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^ ■ Abstract 
(N ■ 

wjrx' We present new approaches to constructing graph sparsifiers — weighted subgraphs for 

^ . which every cut has the same value as the original graph, up to a factor of (lie). Our first 

.^ ' approach independently samples each edge uv with probability inversely proportional to the 

^^ . edge-connectivity between u and v. The fact that this approach produces a sparsifier resolves 

^ a question posed by Bencziir and Karger (2002). Concurrent work of Hariharan and Panigrahi 

I— i. also resolves this question. Our second approach constructs a sparsifier by forming the union 

C^ ' of several uniformly random spanning trees. Both of our approaches produce sparsifiers with 

^^ . 0(nlog (n)/e^) edges. Our proofs are based on extensions of Karger's contraction algorithm, 

ly^ ' which may be of independent interest. 

^ ; 1 Introduction 

>: 

VO , Graph sparsification is an important technique in designing efficient graph algorithms. Different 

2~^ ' notions of graph sparsifiers have been considered in the hterature. Roughly speaking, given a graph 

Q ■ G = {V, E), a sparsifier G' of G is a sparse subgraph of G that approximates G in some measures, 

jX . e.g., pairwise distance, cut values, or the quadratic form defined by the graph Laplacian. G' may be 

^^ I weighted or not. Throughout this paper, we let n = \V\ and m = \E\. Since many graph algorithms 

^— ^ ' have running times that depend on m, if G is dense, then the running time can be improved by 

• • . replacing G with G', possibly with some loss in the quality of the solution. 

. !_^ I Let us define a cut sparsifier to be a weighted subgraph that approximately preserves the value 

^ ■ of every cut to within a multiplicative error of (lie). The main motivation for cut sparsifiers 

^ . was to improve the runtime of approximation algorithms for finding various sorts of cuts; indeed, 

they have been used extensively for this purpose [241 HI fH 127). The first cut sparsifier was Karger's 
graph skeleton [231 ED- He showed that sampling each edge independently with probability p = 
G(log n/e^c), where c is the size of the min cut, gives a sparsifier of size 0{pm). Unfortunately, 
this is of little use when c is small. The celebrated work of Bencziir and Karger JIJ 15] improved on 
this by using non-uniform sampling, obtaining a cut sparsifier with only 0(nlogn/e^) edges. Their 
sparsifier is constructed by randomly sampling every edge with probability inversely proportional 
to its edge strength, and weighting the sampled edges accordingly. 

Spielman and Teng [381 ES] define spectral sparsifiers — subgraphs that approximately preserve 
the quadratic form of the graph Laplacian. Such sparsifiers are stronger than the previously men- 
tioned sparsifiers that only preserve cuts. Spielman and Teng's motivation for studying spectral 
sparsifiers was to use them as a building block for algorithms that solve linear systems in near-linear 
time [381 I36j. They construct spectral sparsifiers with O(nlog'^n) edges, for some large constant 
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c. This was improved to 0{n\ogn/e'^) edges by Spielman and Srivastava [37j, by independently 
sampling edges according to their effective resistances. 

Spielman and Srivastava conjectured that there exist spectral sparsifiers with 0{n/e'^) edges. 
Towards that conjecture, Goyal, Rademacher and Vempala [H] showed that sampling just two 
random spanning trees gives a cut sparsifier in bounded-degree graphs and random graphs. Finally, 
in a remarkable paper, Batson, Spielman and Srivastava [2] construct spectral sparsifiers with only 
0{n/e^) edges. 

In this paper, we study several questions provoked by this previous work. 

• Bencziir and Karger ask: Does sampling according to edge connectivity instead of edge 
strength give a sparsifier? 

• The subgraph produced by Goyal, Rademacher and Vempala is an unweighted subgraph. If 
we sample random spanning trees and apply weights to the resulting edges, does this give a 
better sparsifier? 

• Are there other approaches to achieving sparsifiers with o{n log n) edges? 

In this paper, we give a positive answer to the first two questions. We also give a negative result 
on using random spanning trees to answer the third question. In concurrent, independent work, 
Hariharan and Panigrahi [20] also resolve the first question. 

1.1 Notation 

Before stating our results, we introduce some notation. For a multigraph G = {V, E) with edge 
weights u : E ^ M_|_ and a set F C i5^ of multiedges, the notation u{F) denotes X^egF^e- The 
notation \F\ denotes the total number of copies of all multiedges in F. For any set 5 C y, we 
define 5{S) to be the set of all copies of edges in E with exactly one end in S. So the notation 
u{5{S)) denotes the total weight of the cut 5{S). 

For an edge st G £', the (local) edge connectivity between s and t, denoted kst-, is defined to be 
the minimum weight of a cut that separates s and t. The effective conductance of edge st, denoted 
Cst, is the amount of current that flows when each edge e is viewed as a resistor of value Ue and 
a unit voltage difference is imposed between s and t. A k-strong component of G is a maximal 
fc-edge-connected, vertex-induced subgraph of G. The strength of edge st, denoted by k'^^, is the 
maximum value of k such that a A;-strong component of G contains both s and t. 

Informally, all three of kgt, Cgt and k'^^ measure the connectivity between s and t. The values 
of k'g-i and Cgt are incomparable: k'^i can be Vl{n) times larger than Cgt or vice versa. However 
kst > maxjcst, /c^i} always holds. For more details, see Appendix lAl 

1.2 Our results 

Theorem 1.1. Let G = {V,E) he a simple, weighted graph with edge weights u : E ^ Z+. Let 
e > 1/n and let p := dlog n where d = 0(l/e^). For each edge e, let Ke be a parameter such that 
K,e<k(.. With high probability, the graph G' produced by Algorithni{l\ satisfies 

\u{6{S)) - w{6{S))\ < e-u{6{S)) \/S<^V. (1.1) 

Furthermore, with high probability, 

\E'\ < 2p^|^ + 0(logn). (1.2) 

ee-B ^ 



Algorithm 1 A general algorithm for producing a sparsifier of G by sampling edges. 
procedure Sparsify(G, u, k) 

input: A graph G = {V,E) with edge weights u: E ^ Z+, and connectivity estimates k 
output: A graph G' = {V, E') with edge weights w : E ^f R+ 
For i = 1, . . . , p 

\> We refer to this as the i*^ round of sampling 
For each e & E 

For j = 1,... ,Ue 

With probability pe = 1/ne, add edge e to E' (if it does not already exist) and 
increase We by K^/p 
Return G' and w 



This theorem is proven in Sections [2] and O The condition that e > 1/n is not really restrictive 
because if e < 1/n then the theorem is trivial: G is itself a sparsifier with 0(l/e^) edges. The 
condition that the edge weights are integral is not restrictive either. If the edge weights are any 
positive real numbers then they can be approximated arbitrarily well by rational numbers, and 
these rationals can be scaled up to integers. This does not affect the conclusion of Theorem II .!( as 
it does not depend on the magnitude of u. 

The sampled weight of each copy of edge e is a binary random variable that takes value He/p 
with probability l/^e and zero otherwise. When k^ = ke, this random variable has the highest 
variance, and therefore the cuts of G' are least concentrated. So, at least intuitively, the theorem is 
hardest to prove when k^ = ke, and the result for smallest Ke values will follow as a corollary. This 
intuition is indeed correct, and we obtain several interesting corollaries of Theorem II. II bv invoking 
Algorithm [T] with different Kg values. Proofs are in Appendix iBl 

Corollary 1.2. Let Ke = ke- Then (jl.l|) holds and \E'\ = 0(n log n/e^) with high probability. 

Corollary 1.3. Let Ke = Ce- Then (jl.l|) holds and \E'\ = 0(n log n/e^) with high probability. 

Spielman and Srivastava [37J prove a related result: taking Ke = Ce, only 0(logn/e^) rounds of 
sampling suffice for (jl.ip to hold with constant probability. A simple modification of their proof 
implies Corollary 11.31 (See, e.g., Koutis et al. [30j.) It is unclear whether 0(logn/e^) rounds suffice 
for (II. ip to hold with high probability. 

Corollary 1.4. Let Ke = k'^. Then (jl.l|) holds and \E'\ = 0(n log n/e^) with high probability. 

Benczur and Karger [5j prove a stronger result: taking Kg = k'^, only 0(logn/e^) rounds of 
sampling suffice for (jl.ip to hold with high probability. 

An important aspect of our analysis is that we rely only on Chernoff bounds. In contrast, 
Spielman and Srivastava [37] use sophisticated concentration bounds for matrix-valued random 
variables. An advantage of Chernoff bounds is that they are very flexible and have been generalized 
in many ways. This flexibility enables us to prove the following result in Section HI 

Theorem 1.5. Let G = {V,E,u) be a graph with edge weights u : E ^ Z+, let e > 1/n, and 
let p := dlog n where d = 0(l/e^). With high probability, the graph G' produced by Algorithmic 
satisfies 

\u{6{S)) - w{d{S))\ < e-u{6{S)) \/S^V. 

Clearly \E'\ < p{n - 1) = 0(n log^ n/e^). 



Algorithm 2 An algorithm for producing a sparsifier of G by sampling random spanning trees. 
procedure SparsifyByTrees(G, u) 

input: A graph G = [V^E) with edge weights u : E ^ Z+ 
output: A graph G' = {V, E') with edge weights w : E ^f R+ 
For each e £ E, compute the conductance Cg 
For i = 1, . . . , p 

\> We refer to this as the i round of sampling 

Let T be a uniformly random spanning tree 

For each e £ T 

Add edge e to E' (if it does not already exist) and increase We by Cg/p 
Return G' and w 



Counting Small Cuts. An important ingredient in the proof of Theorem II. II is an extention of 
Karger's random contraction algorithm for computing global minimum cuts |221 [25] . We describe 
two variants of this algorithm which introduce the additional ideas of splitting off vertices and 
performing random walks. The main purpose of these variants is to prove generalizations of Karger's 
cut-counting theorem |22l I25j . which states that the number of cuts of size at most a times the 
minimum is less than n?". Our generalizations give "Steiner variants" of this theorem. Roughly 
speaking, we show that, amongst all cuts that separate a certain set of terminals, the number of 
size at most a times the minimum is less than n"^". 

Since our cut-counting result may be of independent interest, we state it formally now. 

Theorem 1.6. Let G = {V,E) be a graph and let B C E be arbitrary. Suppose that ke > K for 
every e G B. Then, for every real a > 1, 

\{ 6{S) DB : SQV A \6{S)\ < aK }\ < n^". 

We discuss this theorem in further detail in Section [3l For now, let us only mention that this 
theorem reduces to Karger's cut-counting theorem by setting B = E and setting K to the global 
minimum cut value. In this special case, it states that the number of a-minimum cuts is at most 

. In concurrent, independent work, Hariharan and Panigrahi |20] have also proven Theorem 1 1.61 
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1.3 Algorithms 

In this section we describe several algorithms to efficiently construct sparsifiers. To make Algo- 
rithm [1] into a complete algorithm, the most challenging step is to efficiently compute the k^ values. 
This can be done by computing estimates for either ke, Ce, or fcg. 

Edge Connectivity. The simplest approach is to estimate kg. Several methods for computing 
such estimates can be found in the work of Benczur and Karger [5]. In fact, these methods can be 
significantly simplified because they were originally designed for estimating fcg, which is more chal- 
lenging to estimate than kg- The following theorems describe how we can combine these methods 
with Algorithm [T] to efficiently construct sparsifiers. The resulting algorithms are simple enough 
for a real-world implementation, and they have theoretical value too: they can be used to improve 
the 0(m log n) running time of Benczur and Karger's sampling algorithm to nearly linear time 
(cf. Theorem ll.Op . 



These algorithmic results were also described in the earlier work Hariharan and Panigrahi [20]. 
In fact, their runtime bounds are slightly better, due to a different method of analysis. 

Theorem 1.7. Given a graph G and edge weights u : E ^ Z+ where maXeUe = poly(n), a 
sparsifier G' of size 0(n log'^n/e^) that satisfies (|1.1|) can be computed in 0{m -\- nlog^ n/ e^) time. 
If G is simple, the time complexity can be reduced to 0{m). 

Theorem 1.8. Given a graph G and edge weights u : E ^ Z+ where maxgWe = poly(n), a 
sparsifier G' of size 0(nlog^n/e^) that satisfies (jl.ip can be computed in 0(mlog^ n + nlog'^n/e^) 
time. 

Combining Theorems 11.71 and 11.81 with Bencziir and Karger's algorithm, we can obtain the 
following result. 

Theorem 1.9. Given a graph G and edge weights u : E ^ Z+ where maXgUe = poly(n), a 
sparsifier G' of size 0(n log n/e^) that satisfies (jl.ip can be computed in 0(m + n log n/e^) time. 

Proofs of these theorems can be found in Appendix iHl 

Eflfective Conductance. As described above, Spielman and Srivastava [37j also construct spar- 
sifiers by sampling according to the effective conductances. Moreover, they describe an algorithm 
to approximate the effective conductances in m log^ ' n time. This algorithm can be implemented 
more efficiently using the recent simplified method of Koutis, Miller and Peng [30]. Combining this 
with Algorithm [H we can construct a sparsifier with 0(nlog n/e^) edges in 0(m log n) time. 

Random Spanning Trees. Algorithm [2] can also be implemented efficiently, although we do 
not know how to do this in nearly linear time. The best known algorithms for sampling (approx- 
imately) uniform spanning trees run in 0(min |m-y/n, n^'^^^j) time [8l|26]. Combining this with 
the method described above for approximating the effective conductances gives an algorithm to 
compute sparsifiers with 0(n log n/e^) edges. The running time of this algorithm is dominated 
by the time needed to sample 0(log n/e^) random spanning trees. Although this algorithm is not 
as efficient as those listed above, the numerous special properties of random spanning trees might 
make it useful in other ways. 

1.4 Limits of sparsification 

In Corollaries [121 [13] and [131 the number of rounds of sampling p cannot be decreased to o(logn). 
To see this, consider a path of length n — with probability tending to 1 the sampled graph would 
be disconnected and hence not approximate the original graph. 

Sampling random spanning trees overcomes this obstacle since the graph is connected with 
probability 1. Indeed, Goyal, Rademacher and Vempala [19] show that, for any constant-degree 
graph, the unweighted union of just 2 spanning trees approximates every cut to within a factor 
O(logn). In Section [4. II we prove the following negative result for sampling random spanning trees. 

Lemma 1.10. For any constant c > 1, there is a graph such that Algorithmic requires P = il(logn) 
to approximate all cuts within a factor c with constant probability. 



2 Sparsifiers by independent sampling 

In this section we prove our main result, Theorem ll.il Perhaps the most natural approach would 
be to analyze the probability of poorly sampling each cut, then union bound over all cuts. In 
Appendix \C\ we explain why this simple approach fails, why Bencziir and Karger [5j proposed to 
decompose the graph and separately analyze the pieces, and why their approach does not suffice 
to prove Theorem ll.il 

Our analysis also involves partitioning the graph, but using a different approach. In fact, a very 
similar partitioning was used in an earlier proof of Bencziir and Karger [U §3.2] [3l §9.3.2]. We 
will partition the graph into subgraphs, each consisting of edges with roughly equal values of kg. 
Formally, we partition G into subgraphs with edge sets Ei,E2, ■ ■ ■, where 

Ei = {e£E : 2' <ke< 2*+^ } . 

We emphasize that Ei is defined using k^, not Ke- 

To prove that the weights of all cuts are nearly preserved (i.e., that Eq. (jl.ip holds), we will 
use a Chernoff bound to analyze the error contributed to each cut by each subgraph E^. A union 
bound allows us to analyze the probability of large deviation for all cuts simultaneously. As in 
previous work \24\ [5], the key to making this union bound succeed is to show that most cuts are 
very large, so their probability of deviation is very small. This is achieved by our cut-counting 
theorem. Theorem 11.61 

Prom this point onwards, to simplify our notation, we will no longer think of G as a weighted 
graph, but rather think of it as an unweighted multigraph which has Ug parallel copies of each edge 
e. The main benefit of this change is that the total weight of a cut can now be written |5(S')| 
instead of u{5{S)) since, for multigraphs, the notation \5{S)\ gives the total number of copies of all 
multiedges in S{S). We hope that this choice of notation makes the following proofs easier to read. 

The crucial definition for this paper is as follows. We say that a non-empty set of edges F <^ Ei 
is induced by a cut C if F = COEi. Any such set F is called a cut-induced subset of Ei. Note that 
F could be induced by different cuts C and C", i.e., C Ci Ei = F = C Ci Ei. Por a cut-induced set 
FQEi, define 

q{F) := mm{ \6{S)\ : S<^V A 6{S)nEi = F}. (2.1) 

So q{F) is the minimum sizqj of a cut that induces F. This is an important definition since the 
amount of error we can allow when sampling F is naturally constrained by the smallest cut which 
induces F. 

We also define a "normalized" form of q{F), which is a{F) := q{F)/2'^. Note that 2* is a lower 
bound on the size of any cut that intersects F, because every edge e G Ei has 2' < kg. So we can 
think of a{F) as a quantity that measures how close q{F) is to the minimum size of any cut that 
intersects Ei. Clearly a{F) > 1. 

Por any set F of edges, the random variable Xp denotes the total weight of all sampled copies 
of the edges in F, over all rounds of sampling. The main challenge in proving Theorem 11.11 is to 
prove concentration for all Xp where F is a cut-induced subset of some Ei. 



^We remind the reader that the notation |(5(5')| imphcitly involves the edge muhiplicities. So, thinking of G as a 
weighted graph, q{F) is really the minimuni weight of a cut that induces F. 



2.1 The bad events 

Let F (^ Eihe a cut-induced set. We now define three bad events which indicate that the edges in 
F were not sampled well. The first two events are: 

Af ■■ \Xf-\F\\ > e\F\ 

Bf: \XF-\F\\>'-fl 

mn 

The third event is not needed to analyze unweighted graphs (i.e., if every edge multiplicity is 
tie = 1); it is only needed to deal with arbitrary weights. The third event is 

\ I i^ I m n / 

where 5 : M — )• M is the function defined by 

g{x) = (l + x)ln(l + 2;)-x. (2.2) 

Note that its derivative is g'{x) = ln(l + x), so 5 is strictly monotonically increasing on M_|_, and 
hence invertible. Furthermore, g~^ is also strictly monotonically increasing on M_|_, by the inverse 
function theorem of calculus. 

We will show that, assuming that these events do not hold (for certain cut-induced sets), then 
the weights of all cuts are approximately preserved. To this end, we bound the probability of the 
bad events by the following three claims. These claims are proven by straightforward applications 
of Chernoff bounds in Appendix [Dj Recall that the parameter d in the statement of the claims 
satisfies d = 0(l/e^), as stated in Theorem ll.il 

Claim 2.1. Let F Q Ei be a cut-induced set with q{F) < \F\ Inn. Then 

Fr[AF] < 2n-'^°(^>'/^ 
Claim 2.2. Let F Q Ei be a cut-induced set with q{F) > \F\ Inn. Then 

Pt[Bf] < 2n-'^"(^)^'/^ 

Claim 2.3. For every cut-induced set F , 

Pr[Ci.]<n~'^"(^)^/^ 

Claim 2.4. By choosing d = 0(l/e^) sufficiently large, then with high probability, every cut- 
induced set F satisfies 

• if q{F) < \F\ Inn then Af does not hold; 

• if q{F) > \F\ Inn then Bf does not hold; and 

• Cf does not hold. 

The proof of Claim 12.41 given in Appendix [Dl is a straightforward modification of an argument 
of Karger [24] . The only difference with our proof is that we require a result which bounds the 
number of small cut-induced sets. Such a statement is given by Corollarv 12.51 which follows directly 
from Theorem 11.61 
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Corollary 2.5. For each i and any real number a > 1, the number of cut-induced sets F <^ Ei 
with a{F) < a is less than n^". 

Proof. Since every e £ Ei satisfies kg > 2*, we may apply Theorem 11.61 with B = Ei and K = 2^. 
This yields 



|{ 5{S) DEi : S<ZV A \6{S)\ < a2' }\ < 



n 



2a 



Now, by the definition of a{F) and q{F), for every cut-induced set F ^ Ei with a.{F) < a, there 
exists a cut 6{S) such that d{S) Ci Ei = F and \S{S)\ < a2*. This proves the desired statement. ■ 

2.2 All cuts are preserved 

In this section we prove that Eq. (jl.ip holds. Recall that the random variable Xc denotes total 
weight of all sampled edges in C. Our main lemma is 

Lemma 2.6. With high probability, every cut C satisfies \Xc — \C\\ = 0{e\C\). 

For unweighted graphs, the proof of this lemma is quite simple. For weighted graphs, we require 
the following three technical claims, which are proven in Appendix lEl 

Claim 2.7. Let F Q Ei be a cut-induced set. Then \F\ < n^2*. 

Claim 2.8. For any integer d >0, 

Y^ \CnEi\ < 2^~'^\C\. 

i<lg\C\-2lgn-d 

Claim 2.9. Define h -.R^R by 

n(x) 



ln(l + ^) ■ 
Then g~^{x) < h{x) for all x > 0. 

Proof (of Lemma l2.6p . We wish to prove that \Xc — \C\\ < 0(e|C|). We may assume that the 
conclusions of Claim 12.41 hold, since they hold with high probability. We use those facts to bound 
the error contribution from each cut-induced set Ci := C D Ei. 

To perform the analysis, we partition the Cj's into three classes, according to which bad event 
{Aci, Bci or Cci) will be used to analyze the error. This partitioning depends on the threshold 



The sets of indices are 



Ti = {i 
T2 = {i 
Is = {i 



t := lg|C|-41gn-lg(l/e). 

\Ci\>q{Ci)/\nn} 

0< \Ci\ <q{Ci)/lnn A t<i<lg|C| } 

0< \Ci\ < q{Ci)/lnn A <i < t}. 



We remark that 



lg|C| -t < 51gn (2.3) 

since we assume that e > 1/n. 

To analyze the error Xc — \C\, we expand it as a sum over cut-induced sets. 

xc-\c\ = y(Xc^-\a\) + y(xc^-\a\) + V(Xc,-iqi) (2.4) 



>:(^c..- 


-\a\) + }]{xc,- 


-\Q\) + >;(xc, 



Unweighted graphs. For unweighted graphs, the analysis is simple. Since any cut satisfies 
\C\ < V? ^ we have Ig |C| < 2 Ig n and so X3 = 0. We have assumed that the conclusions of Claim WM 
hold, so the events Ad and i3c. do not occur (under the stated conditions on q(Ci)). Therefore 

\xc-\c\\ < Y.\Xd-\a\\ + Y,\xa-\a\\ 

- Z^ I *i Z^ Inn 

ieXi ieX2 

< e\C\ + '-^^^ (since q{a) < \C\) 

Inn 

= 0(e|C|) (by Eq. ^3^). (2.5) 

This completes the proof for the unweighted case. 

Weighted graphs. For weighted graphs the analysis is slightly more complicated because the 
number of cut-induced sets Q that contribute error may be much larger than Ign, because X3 may 
be non-empty. To show that the total error is still small, we will need to use the events Cd ■ 

Consider Eq. ()2.4p again. The first two sums were analyzed in Eq. ()2.5p . so it suffices to analyze 
the third sum. First we prove a lower bound on this sum: 

Y,{\Q\-Xd) < 5]|Q| = 0{e\C\), 
ieXs ieXs 

by Claim [231 and the definition of t. 

Now we prove an upper bound. By Claim [2^ we may assume that the events Cd do not hold. 

We use the fact that g~^ is monotonically increasing and that \C\ > q{Ci). This yields 






Li m n 



< y^HT^^]\Ci\ (by Claim ESD 



JGX3 

\\C\\ ^ 1 



I Inn ; -^ 



'eX3 log (1 + ^^) 



e\C\\ ^ 1 



< o P^^ y , , , by ClaimEB 

\ ^ / »eX3 ig U^ig(„)2» 



I Incrr) ; Z--' 



logn / -=— 1 lo- f ___i^il__ 



e|C|\ ^ 1 



^oSi E 



logn/ ''—I „■ I i„ / /sICI 

^ t-2GX3 « + ig (^„^lg(„)2i 
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< Oh^ >: -^-- (2.6) 



The last inequality holds since 



e\C\ e\C\ ^ n2 



n2lg(n)2* „2ig(„)2ig|C|-4ig'^-ig(iA) - \gn' 

The sum in Eq. (j2.6p is a subseries of a harmonic series with at most n^ terms (since there are at 
most (2) distinct Kg values) so the value of this sum is O(logn). Thus we have shown that the 
third sum in Eq. ()2.4p is at most 0(e|C|). ■ 

2.3 The size of the sparsifier 

To complete the proof of Theorem ll.il it remains to show that G' does not have too many edges, 
i.e., Eq. ([O]) holds. We have 

E [ I £"' I ] = y Pr [ at least one copy of e is sampled in one of the rounds ] 

edge edE 
edge e£i? 

< p E -' 

He 
edge eS-E 

so the right-hand side of Eq. (jl.2p is at least 2E [ \E'\ ] + O(logn). By a simple Chernoff bound, it 
follows that Eq. (11.20 holds with high probability. 



3 The cut-counting theorem 

In this section, we prove Theorem 11.61 which is our generalization of Karger's cut-counting the- 
orem \12\ I25j . The proof of Karger's theorem is based on his randomized contraction algorithm 
for finding a global minimum cut of a graph. Roughly speaking he shows that, for any small 
cut-induced set, it has non-negligible probability of being output by the algorithm, and hence the 
number of small cut-induced sets must be small. We will prove our generalized cut-counting theo- 
rem by analyzing a variant of the contraction algorithm which incorporates the additional idea of 
splitting off vertices. 

The formal statement of our theorem is: 

Theorem 11.61 Let G = {V, E) he a graph and let B (1 E be arbitrary. Suppose that kf, > K for 
every e £ B. Then, for every real a > 1, 

\{ 5{S) DB : SQV A \6{S)\ < aK }\ < n^". 

This theorem becomes easier to understand by restating it using the terminology of Section [2] 
(cf. Corollary [23]). A cut-induced subset of B is precisely a set of the form 6{S)r]B, so the theorem 
is counting cut-induced sets satisfying some condition. This condition is: for a cut-induced set F, 
there must exist S (^ V with \S{S)\ < aK and F = 5{S) n B. This condition is equivalent to 
q{F) < aK, where q is the function defined in Eq. (j2.ip . So the conclusion of Theorem 11.61 can be 
restated as 

|{ F : F is a cut-induced subset of B A q{F) < aK }| < n^". 

10 



Comparison to Karger's theorem. For the sake of comparison, Karger's theorem is: 

Theorem 3.1 (Karger |221I25| ). Let G = (V^E) he a connected graph and let B C E be arbitrary. 
Suppose that the value of the global minimum cut is at least K . Then, for every real a > 1, 

|{ C : C is a cut in G A \C\ < aK }\ < n^". 

Our theorem improves on Karger's theorem in two ways. First of all, we count cut-induced sets 
instead of cuts. This is clearly more general and, as we mentioned before, it is useful because it 
avoids overcounting cut-induced sets that are shared by many cuts. Secondly, we want to bound 
the number of "small" cut-induced sets in B. The bounds given by both theorem are n^", where 
a measures how small a cut or a cut-induced set is. However in our cut-counting lemma, a is 
relative to K, the size of a smallest cut that intersects with B, not relative to the size of a global 
minimum cut as in Karger's theorem. This is an improvement since the global minimum cuts may 
not intersect B at all, so the global minimum cut value could be much smaller than K. 

For concreteness, consider the example in Figure [H which appears in Appendix [Aj Suppose we 
want to bound the number of cuts that intersect with Eign- (Here E\gn consists of the single edge 
st.) Note that all such cuts have size n — 1. However the global minimum cuts all have size 2, and 
they do not intersect with Eign- From Theorem 13.11 we see that there are at most n"~^ cuts of 
size at most n — 1 that intersect Eign- In contrast. Theorem 11.61 states that there are at most n^ 
cut-induced subsets of Eig „ that are induced by cuts of size at most n — 1 . 

A ^veaker theorem based on effective conductance. We have also proven a weaker version 
of Theorem 11.61 which does not suffice to prove Theorem 11.11 but does suffice to prove Corollary 11.31 
This weaker version is stated as Theorem l3.21 it is weaker because the hypothesis Ce > K is stronger 
than the hypothesis ke > K, by Claim [ATTl 

Theorem 3.2. Let G = (y,E) be a graph and let B Q E be arbitrary. Suppose that Ce > K for 
every e £ B. Then, for every real a > 1, 

\{F : F is a cut-induced subset of S A q{F) < aK }\ < r?'^ . 

Although this theorem is weaker than Theorem 11.61 we feel that it is worth including in this 
paper because its proof is based on analysis of random walks that may be of independent interest. 
The proof is in Appendix iGl 

3.1 The generalized contraction algorithm 

Theorem 11.61 follows immediately from Theorem 13.31 which is the analysis of our generalized con- 
traction algorithm (Algorithm [3]) . Henceforth, we will use the following terminology. The edges 
in B are called black. Also, a cut is black if it contains a black edge, and a vertex is black if it is 
incident to a black edge. An edge, vertex or cut is white if it is not black. 

Theorem 3.3. For any cut-induced set F <^ B with q{F) < aK , Algorithm outputs F with 
probability at least n~^". 

Algorithm [3] is essentially the same as Karger's contraction algorithm, except that it maintains 
the invariant that G has no white vertex by splitting- off. For a pair of edges uv and vw, splitting-off 
uv and vw is the operation that removes uv and vw then adds a new edge uw. This splitting-off 
operation is admissible if it does not decrease the (local) edge connectivity between any pair of 
vertices s and t, except of course when one of those vertices is v. Splitting-off has many applications 
in solving connectivity problems because of the following theorem. 
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Algorithm 3 An algorithm for finding a small cut-induced set by splitting off white vertices. 
procedure Contract (G, B, a) 

input: A graph G = (y,E), a set B (1 E, and an approximation factor a 
output: A cut-induced subset of B 
While there are more than [2a] vertices remaining 
While there exists a white vertex v 

Perform admissible splitting-off at v until v becomes an isolated vertex 
Remove v 
Pick an edge e uniformly at random 
Contract e and remove any self loops 
Pick a non-empty proper subset SofV uniformly at random and output the black edges with 
exactly one endpoint in S 

Theorem 3.4 (Mader [33]). Let G = (V^E) he a connected graph and v £ V be a vertex. If v has 
degree ^ 3 and v is not incident to any cut edge, then there is a pair of edges uv and vw such that 
the splitting-off of uv and vw is admissible. 

Since Algorithm [3] needs to perform admissible splitting-off, we must ensure that the hypotheses 
of Mader's theorem are satisfied. This can be accomplished by the simple trick of duplicating every 
edge, which ensures that G is Eulerian and its components are 2-edge-connected. Note that these 
conditions are preserved under all modifications to the graph performed by Algorithm [3l namely 
contraction, splitting-off and removal of self loops. 

To prove Theorem 13.31 we fix a cut-induced set F with q{F) < aK. We will show that, with 
good probability, the algorithm maintains the following invariants. 

(II): F is a cut-induced set in the remaining graph, 

(12): q{E) < aK (where q{F) now minimizes over cuts in the remaining graph), and 

(13): every remaining black edge e satisfies ke > K. 

The only modifications to the graph made by Algorithm [3] are splitting-off operations, contrac- 
tion of edges, and removal of self-loops. Clearly removing self-loops does not affect the invariants. 

Claim 3.5. The admissible splitting-off operations at v do not affect the invariants. 

Proof. For (II), note that splitting-off affects only white edges, and all edges in F are black. 
For (12), note that splitting-off only decreases the size of any cut. For (13), the edge connectivity 
between any two black vertices is unaffected since the splitting-off is admissible and v is white. ■ 

Claim 3.6. Let the number of remaining vertices be r. Assuming that the invariants hold, they 
will continue to hold after the contraction operation with probability at least 1 — 2a /r. 

Proof. For (13), note that any black cut which exists after the contraction also existed before the 
contraction, so the edge connectivity between any two black vertices cannot decrease. 

Now, with respect the graph before the contraction, let C be a minimum cardinality cut that 
induces F, i.e., \G\ = q{F). We claim that Pr[e G C] > 2a/r, where the probability is over the 
random choice of e to be contracted. To see this, note that every remaining vertex u is black, so 
the cut 5{{u}) is a black cut. By invariant (13) we have 6{{u}) > K, so the number of remaining 



12 



edges is at least Kr 12. Since e is picked uniformly at random, 

by (12). Let us assume that e ^ C. Then F is still induced by C after contracting e, so (II) is 
preserved. Furthermore, (12) is preserved since \C\ < aK. ■ 

The following claim completes the proof of Theorem 13.31 We relegate its proof to Appendix [F] 
as it is the same argument used to prove Karger's theorem |22tl25j. (See also Karger |24^ App. A], 
where a slightly more general result is proven.) 

Claim 3.7. The probability that Algorithmic outputs F is at least n~^". 

3.2 Remarks on cactus representations 

A special case of Theorem 13. II is that any connected graph G = {V, E) has at most v? (non-trivial) 
minimum cuts. (In fact, the theorem actually proves a bound of (2), which is tight.) The same 
fact is implied by a much earlier result of Dinic, Karzanov and Lomonosov [TO], which states that 
the minimum cuts have a cactus representation. Fleiner and Frank [15] give a recent exposition of 
this result. 

Dinit^jand Vainshtein |1HI12J generalized this result as follows. (See also Fleiner and Jordan [16]. 
Let [/ C y be a subset of vertices with |C/| > 2. A cut C = 5{S) is called a U-cut if the partition 
{[/ n 5, [/ \ 5"} of [/ that it induces has both parts non-empty. A U-cvX C is called minimal if |C| 
is minimal amongst all [/-cuts. Two minimal [/-cuts are called equivalent if they induce the same 
partition of U . Dinitz and Vainshtein showed that the equivalence classes of minimal [/-cuts have 
a cactus representation. In particular, there are at most n^ equivalence classes of minimal [/-cuts. 

We now explain how the latter result also follows from Theorem 11.61 Let K be the minimum 
cardinality of a [/-cut. We add dummy edges of weight e/n^ between all pairs of [/-vertices and let 
B be the set of dummy edges. Then every dummy edge e has k^ > K and every minimal [/-cut 
has weight at most (1 -|- €)K. By Theorem II. 6^ the number of cut-induced sets induced by cuts 
of size at most (1 -|- e)K is at most n'^^^'^'^' . Any two equivalent minimal [/-cuts induce the same 
cut-induced subset of B, so the number of equivalence classes is at most n^(^"^^\ Taking e — )• 
proves that there are at most n? equivalence classes of minimal [/-cuts. 

4 Sparsifiers by uniform random spanning trees 

In this section we describe an alternative approach to constructing a graph sparsifier. Instead 
of sampling edges independently at random, as was done in Section [21 we will sample edges by 
picking random spanning trees. The analysis of this sampling proves Theorem 11.51 The proof is a 
small modification of the proof in Section [21 with some differences to handle the dependence in the 
sampled edges. The following two lemmas explain why sampling random spanning trees is similar 
to sampling according to effective conductances. 

Let us introduce some notation. For an edge st G E, we denote by rgt the effective resistance 
between s and t. This is the inverse of the effective conductance Cst- 



E. Dinic and Y. Dinitz are two different transliterations of the same person's name. 
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Lemma 4.1. Let G = {V,E) be an unweighted simple graph, and let T be a spanning tree in G 
chosen uniformly at random. Let ei, . . . ,ek £ E be distinct edges. Then 

Pr[ei,...,efeGT] < Pr[ei G T] • • • Pr[efc G T] (4.1) 

Pr[ei,...,efe0r] < Pr[ei T] • • • Pr[efc T] (4.2) 

Proof. In the case k = 2, Equation 14.11 was known to Brooks et al. [T, Equation (2.34)]. See 
also Lyons and Peres [321 Exercise 4.3]. For general k, this is a consequence of Theorem 4.5 in 
Lyons and Peres [32], which is a result of Feder and Mihail |14] . See also Goyal, Rademacher and 
Vempala ^9\ Section 3]. ■ 

One useful consequence of Lemma 14.11 is that concentration inequalities can be proven for the 
number of edges in T that lie in any given subset. The concentration is due to the following 
theorem: 

Theorem 4.2. Let ai,a2, ■ ■ ■ ,ak be reals in [0,1], and let Xi,...Xk be {0,l}-valued random 
variables. Suppose that 

Pr[A,e/X, = 1] < IIPt[X, = I] V/ C [k] 
iei 

Pr[A,6/X, = 0] < llPr[X, = 0] V/ C [k]. 

Suppose fii < E['^-aiXi] < fi2- Then 

Pv[^aiXi<{l-5)fii] < e-^^^'/2 



PrEa.X.> (1 + 5)^2] < {^^^^) 



f^2 



Proof. See Gandhi et al. [Ml Theorem 3.1]. 

We will also use the following corollary. 
Corollary 4.3. Assume the same hypotheses as Theorem \4-.S\ Let fi = E[^-aiXi]. Then 

Pr[| Y^ OiXi - fi\> 5n] < 2e~ 



2(1+5/3) 



Proof. Follows from Theorem 14.21 and basic calculus. See also McDiarmid [341 Theorem 2.3]. ■ 

Now consider the approach of Algorithm [2] for constructing a sparsifier. In each round of 
sampling, instead of picking edges independently, we pick a uniformly random spanning tree. Every 
edge e in the tree is assigned weight Cg. This sampling is repeated for p rounds, and the sparsifier 
is the sum of these weighted trees. 

By Lemma [B. 21 the probability of sampling any particular edge is the same as when sampling by 
effective conductances, as was done in Corollary 11.31 Furthermore, the same analysis as Section [2] 
shows that this sampling method also produces a sparsifier — the only change to the analysis is 
that all uses of Chernoff bounds (namely, in Claims 12.11 12.21 and 12. 3p can be replaced with the 
concentration bounds in Theorem 14.21 and Corollary 14.31 This completes the proof of Theorem 11.51 
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4.1 Lower bound on number of trees 

In this section, we consider the tradeoff between the number of trees (i.e., the value p) and the 
quahty of sparsification in Theorem 11.51 We prove a lower bound on the number of trees necessary 
to produce a sparsifier with a given approximation factor. 

Proof (of Lemma ll.lOp . Let G be a graph defined as follows. Its vertices are {ui, . . . , Wn} U 

{ui, . . . , Vn+i}- For every i = 1, . . . , n, add k parallel edges ViVl_^^^ . . . , i^iWj+i, and a single length- 

(?) 
two path Vi-Ui-Vi-\.i. The edges fifi+i ^^^ called heavy, and the edges ViUi and Uifj+i are called 

light. Note that the heavy edges each have effective conductance exactly {2k + l)/2. The light 

edges each have effective conductance exactly {2k + l)/{k + 1) < 2. 

A uniform random spanning tree in this graph can be constructed by repeating the following 
experiment independently for each i = 1, . . . ,n. With probability 2k/ {2k + 1), add a uniformly 
selected heavy edge ^i^)+i to the tree, and add a uniformly selected light edge ViUi or UiVi+i to 
the tree. In this case we say that the tree is "heavy in position i" . Otherwise, with probability 
1/(2A; + 1), add both light edges ViUi and UjUj+i to the tree but no heavy edges. In this case we 
say that the tree is "light in position i" . 

Our sampling procedure produces a sparsifier that is the union of p trees, where every edge 
e in the sparsifier is assigned weight Ce/p- Suppose there is an i such that all sampled trees 
are light in position i. Then the cut defined by vertices {vi,ui,V2,U2, ■ ■ ■ ,Vi} has weight exactly 
{2k + l)/{k + 1) < 2 in the sparsifier, whereas the true value of the cut is A; + 1. 

The probability that at least one tree is heavy in position i is 1 — {2k + 1)^^. The probability 
that there exists an i such that every tree is light in position i is 

p := 1 - {1 - {2k + 1)-Pr 

Suppose p = lnn/ln{2k + 1). Then lim^-^ooP = 1 — 1/e. So with constant probability, there is an 
i such that every tree is light in position i, and so the sparsifier does not approximate the original 
graph better than a factor -^. ■ 



References 

[1] Sanjeev Arora and Satyen Kale. A combinatorial, primal-dual approach to semidefinite pro- 
grams. In Proceedings of the 39th Annual ACM Symposium on Theory of Computing (STOC), 
pages 227-236, 2007. 

[2] Joshua D. Batson, Daniel A. Spielman, and Nikhil Srivastava. Twice-Ramanujan sparsifiers. 
In Proceedings of the 41st Annual ACM Symposium on Theory of Computing, pages 255-262, 
2009. 

[3] Andras A. Bencziir. Cut structures and randomized algorithms in edge- connectivity problems. 
PhD thesis, Massachusetts Institute of Technology, 1997. 

[4] Andras A. Benczur and David R. Karger. Approximate s-t min-cuts in 0{n'^) time. In Pro- 
ceedings of the 28th Annual ACM Symposium on Theory of Computing (STOC), 1996. 

[5] Andras A. Benczur and David R. Karger. Randomized approximation schemes for cuts and 
flows in capacitated graphs, 2002. arXiv:cs/0207078. 

[6] Bela Bollobas. Modern Graph Theory. Springer, 1998. 

15 



[7] R. L. Brooks, C. A. B. Smith, A. H. Stone, and W. T. Tutte. The dissection of rectangles into 
squares. Duke Math. J., 7:312-340, 1940. 

[8] Charles J. Colbourn, Wendy J. Myrvold, and Eugene Neufeld. Two algorithms for unranking 
arborescences. J. Algorithms, 20(2):268-281, 1996. 

[9] Luc Devroye. Non-Uniform Random Variate Generation. Springer, 1986. Available at 
http : //eg . scs . carleton . ca/~luc/rnbookindex . html. 

[10] E. A. Dinic, A. V. Karzanov, and M. V. Lomonosov. The structure of a system of minimal edge 
cuts of a graph. In A. A. Fridman, editor. Studies in Discrete Optimization, pages 290-306. 
Izdatel'stvo "Nauka", 1976. 

[11] Yefim Dinitz and Alek Vainshtein. The connectivity carcass of a vertex subset in a graph and 
its incremental maintenance. In Proceedings of the 26th Annual ACM Symposium on Theory 
of Computing (STOC), pages 716-725, 1994. 

[12] Yefim Dinitz and Alek Vainshtein. The general structure of edge-connectivity of a vertex 
subset in a graph and its incremental maintenance, odd case. SIAM Journal on Computing, 
30(3):753-808, 2000. 

[13] Peter G. Doyle and J. Laurie Snell. Random walks and electric networks. 

arXiv:math.PR/0001057. 

[14] T. Feder and M. Mihail. Balanced matroids. In Proceedings of the 24th Annual ACM Sympo- 
sium on Theory of Computing (STOC), 1992. 

[15] Tamas Fleiner and Andras Frank. A quick proof for the cactus representation of mincuts. 
EGRES Quick-Proof No. 2009-03, Egervary Research Group on Combinatorial Optimization, 
Eotvos Lorand University, 2009. 

[16] Tamas Fleiner and Tibor Jordan. Coverings and structure of crossing families. Mathematical 
Programming, 84(3):505-518, 1999. 

[17] R. M. Foster. The average impedance of an electric network. In Contributions to Applied 
Mechanics, pages 333-340. Edwards Bros., 1949. 

[18] R. Gandhi, S. Khuller, S. Parthasarathy, and A. Srinivasan. Dependent Rounding and its 
Applications to Approximation Algorithms. Journal of the ACM, 53:324-360, 2006. 

[19] Navin Goyal, Luis Rademacher, and Santosh Vempala. Expanders via random spanning 
trees. In Proceedings of the Twentieth Annual ACM-SIAM Symposium on Discrete Algorithms 
(SODA), pages 576-585, 2009. 

[20] Ramesh Hariharan and Debmalya Panigrahi. A general framework for graph sparsification, 
April 2010. http://arxiv.org/abs/1004.4080. 

[21] Toshihide Ibaraki. Computing edge-connectivity in multiple and capacitated graphs. In SIGAL 
'90: Proceedings of the international symposium on Algorithms, pages 12-20, New York, NY, 
USA, 1990. Springer- Verlag New York, Inc. 

[22] David R. Karger. Global min-cuts in RNC, and other ramifications of a simple min-cut al- 
gorithm. In Proceedings of the 4th Annual ACM/SIAM Symposium on Discrete Algorithms 
(SODA), pages 21-30, 1993. 

16 



[23] David R. Karger. Random sampling in cut, flow, and network design problems. In Proceedings 
of the 26th Annual ACM Symposium on Theory of Computing (STOC), 1994. 

[24] David R. Karger. Random sampling in cut, flow, and network design problems. Mathematics 
of Operations Research, 24(2):383-413, May 1999. 

[25] David R. Karger and Clifi^ord Stein. A new approach to the minimum cut problem. Journal 
of the ACM, 43(4) :60 1-640, July 1996. 

[26] Jonathan A. Kelner and Aleksander Madry. Faster generation of random spanning trees. In 
Proceedings of the 50th Annual IEEE Symposium on Foundations of Computer Science, pages 
13-21, 2009. 

[27] Rohit Khandekar, Satish Rao, and Umesh V. Vazirani. Graph partitioning using single com- 
modity flows. Journal of the ACM, 56(4), 2009. 

[28] G. Kirchhoff. Uber die Auflosung der Gleichungen, auf welche man bei der Untersuchung der 
linearen Vertheilung galvanischer Strome gefiihrt wird. Ann. Phys. und Chem., 72:497-508, 
1847. 

[29] D.E. Knuth and A.C. Yao. The complexity of nonuniform random number generation. In J. E. 
Traub, editor. Algorithms and Complexity, pages 357-428. Academic Press, 1976. 

[30] loannis Koutis, Gary L. Miller, and Richard Peng. Approaching optimality for solving SDD 
systems, 2010. arXiv: 1003.2958. 

[31] Laszlo Lovasz. Random walks on graphs: A survey. In D. Miklos, V. T. Sos, and T. Szonyi, 
editors, Combinatorics, Paul Erdos is Eighty, Volume 2, pages 353-398. Janos Bolyai Mathe- 
matical Society, 1996. 

[32] Russell Lyons and Yuval Peres. Probability on Trees and Networks. Cambridge University 
Press, 2010. In preparation. Current version available at http://inypage.iu.edu/~rdlyons/. 

[33] Wolfgang Mader. A reduction method for edge-connectivity in graphs. Ann. Discrete Math., 
3:145-164, 1978. 

[34] Colin McDiarmid. Concentration. In M. Habib, C. McDiarmid, J. Ramirez-Alfonsin, and 
B. Reed, editors, Probabilistic Methods for Algorithmic Discrete Mathematics, pages 1-46. 
Springer, 1998. 

[35] Hiroshi Nagamochi and Toshihide Ibaraki. A linear-time algorithm for finding a sparse k- 
connected spanning subgraph of a k-connected graph. Algorithmica, 7(5&:6):583-596, 1992. 

[36] Daniel A. Spielman. Algorithms, graph theory, and linear equations. In Proceedings of the 
International Congress of Mathematicians, 2010. 

[37] Daniel A. Spielman and Nikhil Srivastava. Graph sparsification by effective resistances. In 
Proceedings of the 40th Annual ACM Symposium on Theory of Computing (STOC), pages 
563-568, 2008. 

[38] Daniel A. Spielman and Shang-Hua Teng. Nearly-linear time algorithms for graph 
partitioning, graph sparsification, and solving linear systems. In Proceedings of the 
36th Annual ACM Symposium on Theory of Computing (STOC), pages 81-90, 2004. 
http : //arxiv . org/abs/cs . DS/0310051. 

17 



Figure 1: Example showing that conductance can be 0(n) times larger than strength 
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Figure 2: Example showing that strength can be 0(n) times larger than conductance 
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A Discussion of kgt, Cgt and k'^^ 

As mentioned in the introduction, the three quantities of an edge st that we consider (edge con- 
nectivity, effective conductance and edge strength) all roughly measure the connectivity between 
s and t. However their values can differ significantly. In this section, we illustrate this with some 
examples. 

Consider a graph which consists of exactly one edge st. To increase kgt by A;, we can simply add 
k edge disjoint paths between s and t. In the following examples, we can see that no matter how 
large k is, it is possible that k'^f- or Cgt increases only by one while the other increases by 0,(k). 

• In Figure [H s and t are connected by an edge st and n — 2 paths of length 2. Clearly 
kst = n — 1, Cst = ^{n — 2) + 1 = n/2. But k'^f. = 2 as every induced subgraph with at least 
two vertices is at most 2 edge connected. 



In Figure [21 s and t are connected by an edge st and a path of length n — 1 which consists of 

'st = n but Cst = ^ 



edges of weight n — 1. The graph is n-edge-connected so kgt = k'^^ = n but Cgt = ^^-A + 1 = 2. 



Although Cst and k'^^ are incomparable, they are upper bounded by kgf 
Claim A.l. For any edge st E E, kgt > maxjcst, k'^^}. 

Proof. It is immediate from the definition of edge strength that k'^^ < kst, so we focus on the 
effective conductance. Since the connectivity between s and t is kgt, there is a cut of size k^t 
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separating s and t. Contracting both sides of the cut, we get two new vertices s' and t' . By 
Rayleigh monotonicity [T3], c^'f is at least Cst- Clearly Cg't' = kst, so the proof is complete. ■ 



B Corollaries of Theorem 11.11 

First we show that our corollaries satisfy the hypotheses of Theorem ll.il By Claim |A?H Corollaries 
11.21 11.31 and 11.41 all have Hg < kg, so Theorem 11.11 is applicable. 

It remains to analyze \E'\, the number of sampled edges. For Corollaries 11.21 and 11.41 we use a 
property of edge strength proved by Bencziir and Karger [5i Lemma 2.7]. 

Lemma B.l. In a multigraph with edge strengths k'^, we have 

Y^l/k'g < n-\. 

eG-E 

Here the sum is over all copies of the multiedges. 
Thus, for Corollaries 11.21 and 11.41 we have 

E[|i?'|] = pY,Pe = pY.^I''- ^ pY.^IK < P{n-l) = 0{n\og^n/e^). 

e<=E e&E eeE 

Finally, we must bound the size of E' in Corollary 11.31 We require the following lemma. 

Lemma B.2. Let G = {V, E) be a multigraph, and let T be a spanning tree in G chosen uniformly 
at random. Then, for any copy of an edge e € E, Pr[e S T] = l/cg. 

Proof. See Kirchhoff [28], Brooks et al. [Zl pp. 318], Lovasz [HI Theorem 4.1(i) and Corollary 4.2] 
and Lyons and Peres [321 Corollary 4.4]. ■ 

This immediately implies that 

^1/ce = ^Pr[eeT] = E[\E{T)\]=n-l. 

eGE eeE 

This fact is known as Foster's theorem, and it is independently due to Foster [17] and Tetali [40J . 
Thus, 

E[\E'\] = pY,Pe = pY,l/ce = p(n-l) = 0(n log^n/e^). 

eeE eeE 

C Motivation for Partitioning Edges 

The natural first approach to proving Theorem 11.11 would be to bound the probability of large 
deviation for each cut and then union bound over all cuts. This approach is not feasible, as can be 
illustrated using the example in Figure El 

In this graph, s and t are connected by k parallel edges and k paths of length 2. Recall that 
in our sampling scheme each copy of st is sampled with probability 1/2A; for p = 0(log n) rounds 
and is assigned a weight of 2k/ p if sampled. Each edge other than st is sampled with probability 
1/2 for p rounds and is assigned a weight of 2/p if sampled. Consider a set U ^V — t that contains 
s. Then |5(f/)| is at most 2k. Suppose we want to bound the probability that the sampled weight 
w{6{U)) exceeds 4fc. For this to happen, at most 2p copies of st can be included in the sparsifier. 
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Figure 3: The cut-induced set consisting of st is overcounted 2" times if we simply union bound 
over all cuts. 

By a Chernoff bound, this many edges are included with probability at most e"^'' = e '~ ^^ "^ 
However, there are Q(2") such C/'s, which is too many for such a union bound to work. 

The reason this union bound fails is that the event "more than 2p copies of st are included" is 
overcounted 0(2") times, once for each U. However since all (5(C/)'s share the same k copies of st, 
we actually only need to analyze this event once. 

Benczur and Karger [5] accomplished this by decomposing the graph. Assume that the edges 
E = {ei, . . . , em} are sorted by increasing edge strengths. Each Gj contains all edges e, with i > j. 
Then G can be viewed as the sum of Gj's, with each Gj scaled by fcg. — Ajg _^. An important 
property of this decomposition is that if Cj is in Gj then the strength of e, in Gj is the same as its 
in the original graph G. This is because the feg. -strong component H in G that contains Cj must 
also be present in Gj, as all edges in H have strengths at least k'^.. 

Therefore, even though edges in Gj have small sampling probabilities (at most 1/k'^.), the 
expected number of sampled edges in every cut is at least Q{logn/e^), since the min cut of Gj 
is large (at least k'^.). Thus Karger 's graph skeleton analysis is applicable to sampling in Gj. 
Roughly speaking, in order to use the Chernoff bound to obtain a constant factor approximation 
in the number of sampled edges in a cut with a failure probability of n~^^', the expected number 
of sampled edges in the cut needs to be Q{logn). 

To prove Theorem I l.H we could attempt to use the same decomposition to analyze our sampling 
scheme where edge connectivity is used instead of strength. The problem is that in general edge 
connectivity is not preserved under such decomposition. To see this, consider the example in 
Figure [TJ Observe that the subgraph induced by those edges with connectivities at least n — 1 
consists of only one edge st, so this subgraph has min cut value 1. The expected number of copies 
of st in the sparsifier is 0(log n/n), so we cannot expect to say that sampling preserves every cut 
of this subgraph to within lie. 

D Proofs for Section 12.11 

In this section we prove Claim [2711 Claim [272] and Claim [2731 We require the following three versions 
of the Chernoff bound. For the case U = 1, these can be found in the survey of McDiarmid [34j : 
the case of larger U reduces to that case by scaling. 

Theorem D.l. Let Xi, . . . ,Xk be independent random variables with values in [0,1]. Let 
ai,...,ak be scalars in [0,U]. Let X be a weighted sum of Bernoulli trials defined by X = 
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Yli=i o^i^i! o,nd /ei /i = E [X]. Then for any 5 > Q, we have 



Pr[|X-/i| > 5/i] < 2exp 



2{l + 5/^)U ' 
Corollary D.2. Let X and ^ he as in Theorem \D.l[ Then for any < 5 < 1, we have 

Pi[\X-fi\ ><5/x] <2e-^'^/3^. 
Theorem D.3. Let X and /x be as in Theorem \D. 11 Then for any 5 > 0, we have 

where g{6) = {1 + 5) ln(l + 5) — 6 is the function defined in Eq. (j2.2p . 
Claim r2. 11 Let F C Ei he a cut-induced set with q{E) < \F\ Inn. Then 

Pt[Af] < 2n~'^"(^)^'/^ 



Proof. Let U = — — and /x = E [Xp] = \F\. By the definition of our samphng process, Xp is a 
weighted sum of Bernouhi trials where each weight is less than U. Thus 

Fv[Af] = Pt[\Xf-\E\\ >e\E\] 

< 2exp ^-^ (by Corollary [EU 

< 2exp (by our assumption on q{F)) 



< 2exp 



31n(n)2*+i 
e'^a{F)dlnn 



6 / 

This concludes the proof. 

Claim [2.21 Let F C Ei be a cut-induced set with q{F) > \F\ Inn. Then 

Pr [Bf] < 2n-'^"(^)^'/^ 



Proof. Let U = — — , /x = E [Xf] = \F\ and 6 = ^ ?^|L . Then Xf is a weighted sum of Bernoulli 
trials, and U is an upper bound on the weights. Note that 6 > e and 1 > e. Thus 

6^ 6e 6e 

1 + 6/3 - l + e/3 - Y' 

Also, 



Sfi eq{F)p ea{F)dlnn /ni\ 

T7 " ln(n)2^+i " 2 ' ^ ' ^ 
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Thus 



PrlBp] = Pr 



\Xp-\F\\ > 



eq{F) 



Inn 

Pi[\Xf- \F\\ >6\F\] 

5V ^ 



< 2exp 



2{l + 6/3)U 



^'^^A-'-i 



< 2exp 



e'^a{F)dlnn 



(by Theorem ID. ip 



(by Eq. dEI])) 



This concludes the proof. 

Claim 12. 3L For every cut-induced set F , 



Proof. Let [/ = ^^ and /i = E [Xi. ] = |F|. Then 



Pt[Cf] = Pr 



Xf -\F\>g 

-2 



-1 



l{F)\ 



< 



exp 



< exp 



< 



exp 



eHF) \ \F\ 
|F|lnn/ U 

e'q{F)p \ 
ln(n)2*+i^ 

e'^a{F)dlnn 



\F\lnnJ 

(by Theorem ID. 3p 



This completes the proof. ■ 

Claim 12.41 By choosing d = 0(1/6^) sufficiently large, then with high probability, every cut- 
induced set F satisfies 

• if q{F) < \F\ Inn then Af does not hold; 

• if q{F) > \F\ Inn then Bf does not hold; and 

• Cf does not hold. 

Proof. Fix an i and let F^, F^, . . . be all the cut-induced subsets of Ei, ordered such that q{F^) < 
q{F^) < .... Let 

JPt[AfUCf] (if g(F) < |F|lnn) 
^^ ~ \pi[BfIJCf] (if g(F) > |F|lnn). 
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By Claims [27n 12.21 and 12.31 there exists a value d = 0(l/e^) such that 

We consider the first n^ cut-induced sets. Note that for all j > 1, pj < 4n~*^''"*'^^ Therefore, a 
union bound shows that the probability that any bad event happens for some F^ with 1 < j < n^ 
is at most n^ • pi < An~'^ . 

Now we consider the remaining cut-induced sets F^ C Ei for j > n^. Corollary 12.51 states that, 
for any a > 1, 

|{ cut-induced set F O Ei : a{F) <a}\ < n^". 

Letting a > 1 be such that j = n^", we see that a{F^) > a = 2iHn^- Thus, from Eq. ()D.2p we have 



Pj 



< 4n-(''+2)°(^') < 4^-{r+2)ln(j)/21nn < ^^._(r+2)/2 



Thus 



Y.PJ ^ 


1^ «-. 


r+2)/2 


i>n^ 


j>n2 


< 


fOD 

Jrfi 




-8i-'/2 


oo 




r 


n? 


= 


0(n-'^). 





dj 



This completes the proof. 



E Proofs for Section [2721 

Claim [2T71 Let F Q Ei he a cut-induced set. Then \F\ < n^2*. 

Proof. Since F C Ei, every e G -F satisfies /cg < 2*^"*^. Since Ue < fee for every e, we obtain 
lie < 2*+i. Thus |F| = Y^edF^e < (2)2*+^- This proves the claim. ■ 

Claim 12.81 For any integer d > 0, 



Y^ \Cr\Ei\ < 2^-'^\c\. 

i<h\C\-2\gn-d 



Proof. By Claim O 



Y, \CnEi\ < Yl ^'2* ^ ^22ig|ch2ign-d+i ^ 2i-'^|C|. 

i<lg|C|-21gn-d i<lg|C|-21gn-d 

This completes the proof. 

Claim ^M Define h:R-^Rby 

h[x) 



ln(l + ^) ' 
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Then g ^(x) < h{x) for all x > 0. 

The purpose of this claim is to give a simple, asymptotically tight upper bound on g^^ . We 
thank "mathphysicist" from the web site MathOverflow for pointing out that a precise expression 
for (7~^ can be given using the Lambert W function. Specifically, one can show that 

g-\x) = exp (vr(^) + l) - 1. 



Unfortunately this exact expression is not terribly useful, since we do not know of any simple, 
asymptotically tight bounds on W . 

Proof (of Claim [2T9|) . For all x > 0, we have 

^ > ln(l + V^) 

=^ hix) = - — ■^=- > \fx 

^ ' log(l + ^) - 

=^ ln(l + /i(x)) > ln(l + Vx) 

^ h(x) . ln(l + h(x)) = '^'"^^^^^^^^ > 2x. (E.l) 

Next, for y > 0, 

1 1 V 

> 



Integrating, we obtain 



1 + y - 1 + y (l + 2/)2' 



ln(l + y) > 



1 + y 



ln(l + y) > ^(ln(l + y) 



Integrating again, we obtain 



2V ^ -' 1 + y 



il + y)Hl + y)-y> ^'^^'^'^ 



2 
Substituting y = h{x) into this and applying Eq. (jE.ip yields 

g{h{x)) = {l + h{x))\n{l + h{x))-h{x) > Hx)H'^ + Hx)) ^ ^^ 
As observed above, g~^ is strictly monotonically increasing. Thus g{h{x)) > x implies that h{x) > 

g-^x). m 



F Probability of success in Algorithm [3] 

In this appendix we complete the proof of Theorem 13.31 bv proving the following claim. 



Claim [3.71 The probability that Algorithmic outputs F is at least 
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n 



-2a 



Algorithm 4 An algorithm for finding a small cut-induced set by contracting random walks. 

procedure ContractRW(G, B, a) 

input: A graph G = (y,E), a set B <^ E, and an approximation factor a 

output: A cut-induced subset of B 

While there are more than [2a] black vertices remaining 

Randomly pick a black vertex ui with probability proportional to its degree 

Perform a random walk starting from ui and stopping when it hits a black vertex U2 

(possibly ui = U2) 

If Ul y^ U2 

Contract all edges traversed by the random walk and remove any self loops 
Pick a non-empty proper subset S* of ^ uniformly at random and output the black edges with 
exactly one endpoint in S 



Proof. Define R := [2a] . In the last iteration of the algorithm, the number of remaining vertices 
is at least R+ 1. The probability that the invariants hold at the end of the algorithm is at least 
the product of the probabilities that the invariants are not violated at any step of the algorithm. 
By Claims 13.51 and 13.61 this probability is at least 

H+l 

r=n 

where the factorial function is extended to arbitrary real numbers via the Gamma function. 

Since there are at most R remaining vertices at the end of the algorithm there are less than 
2 remaining non-trivial cuts, at least one of which induces F, by (II). Therefore, the probability 
that the last step of the algorithm selects a set S that induces F is at least 

,_R in-2a)\ Rl I 1 ,, 

{R-2a)\n\ n'^"{R-2a)l " 

where we have used the inequalities 2^^^ < Rl, n\/{n — 2a)! < n^°, and x! < 1 for x G [0, 1]. ■ 



G Random contraction algorithm by random walks 

In this appendix we present Algorithm HI which is a variant of the contraction algorithm that 
contracts random walks instead of random edges. We use the similar terminology and notation 
to Section [3l e.g., black vertices. The following analysis of the algorithm immediately implies 
Theorem [ 



2a 


n — 2a n — 1 — 2a 


R+l-2a 
R+1 


{n-2a)l R\ 


r 


n n — I 


{R-2a)\ n!' 



Theorem G.l. For any cut-induced set F Q B with q{F) < aK , Algorithmic outputs F with 
probability at least n~^". 

The approach for proving Theorem lG.il is again similar to Karger's analysis of the contraction 
algorithm — for any cut C, we can bound the probability that an edge in C is contracted. Formally, 
our analysis is: 

Lemma G.2. Consider an iteration of the while loop that begins with r remaining black vertices. 
Suppose that no edge in C has been contracted so far. Suppose that the random walk in this iteration 
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has ui ^ U2- Then 

2\C\ 

Pr[sonie edge in C is contracted in this iteration] < . 

Kr 

To prove Theorem lG.il one applies Lemma IG. 21 where C is a cut which induces F and satisfies 
\C\ < q{F) < aK. The remainder of the proof fohows by the same argument as Claim [37fl 

The key method in proving Lemma IG.2I is to understand the probability that a random walk 
hits a certain set of vertices before hitting some other set of vertices. To that end, let us introduce 
some notation. For any two sets of vertices X and Y, let condx,y denote the effective conductance 
between X and Y. Equivalently, identify X into a single vertex x, identify Y into a single vertex 
y, and let condx,y be the effective conductance between x and y. 

Next, suppose that s £ V and that T and U are disjoint subsets of V. We use s — )• T < [/ to 
denote the event that a random walk starting at s hits T before it hits U. If T = {t}, we use the 
shorthand s — )■ t < C/, and similarly if f7 = {u}. In the case that s G TUU, the term "hits" means 
"hits after performing at least one step of the random walk" . 

Remark. The event s ^ T < U can also be understood in another way. Let G' be the graph 
obtained by identifying all nodes in T into a single node t, and identifying all nodes in U into a 
single node u. Then Pr[s — )• T < [/] equals the probability that a random walk in G' starting at s 
hits t before u. 

The main tool in the proof of Lemma IG.2I is the following reciprocity law. It will allow us to 
consider random walks originating at the cut G rather than random walks that cross G. 

Lemma G.3. Let s,t €z V and C/ C y. Assume that s ^ t and {s, t} n [/ = 0. Then 

conds^{t}uu • Pr[s -^ t < U] = condt^{^}uc/ • Pr[t -^ s < U]. (G.l) 

To prove this lemma, we need to understand the relationship between the following events: 

£ = s^T <{s}UU 
£' = s^TUU <s 
£* = s^T <U. 

In English, £ is the event that the random walk hits T before hitting U or returning to s, f ' is the 
event that the random walk hits T oi U before returning to s, and £* is the same as £ except that 
the random walk is permitted to return to s before hitting T or U. 

Claim G.4. £ = £* A£'. 

Claim G.5. £* and £' are independent. 

Proof. The claim essentially follows from the "craps principle". In more detail, consider any 
random walk starting s and ending at TUU. It can be viewed as a sequence wi, . . . ,Wk of random 
walks where 

• for each i < k, Wi starts and ends at s but otherwise does not traverse s, 

• Wk starts at s and ends at T U C/ but otherwise does not traverse s. 

So Pr[iS*] is the probability that Wk ends at T, and by the Markov property, this equals 

Pr[s^r< U\s^TUU <s]. 
Thus we have argued that Pr[iS*] = Fy[£* \ £'], as required. ■ 
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Claim G.6. Fi[S'] = cond,,Tuc/ Ms). 

Proof. Doyle and Snell [131 §1-3.4]. ■ 

Proof (of Lemma fG.Sp . It is known [6l Theorem IX. 22] that 

d{s) ■ Pr[s ^t<{s}UU] = d{t) ■ Pr[t ^ s < {t} U U]. 
By Claim [GT4l and Claim [GT5| this is equivalent to 

d{s) ■ Pr[s ^{t}UU <s]- Pr[s -^ t < U] = d{t) ■ Pr[t ^{s}UU <t]- Pr[t ^ s < U]. 
By Claim IGTgI this is equivalent to (jG.ip . ■ 

Proof (of Lemma IG.2p . It is more convenient to consider hitting a vertex than a cut, so we 
subdivide every edge in C and merge the subdividing vertices into a new vertex z. We consider the 
random walk in the modified graph induced by the random walk in the original graph. The former 
walk hits z iff the latter walk intersects C. 

For the remainder of this proof, W denotes the set of currently remaining black vertices. 

Claim G.7. For any w G W, 



Pr[ui = w \ ui ^ U2] 



Proof. Let D = T^veW^i^)- Then 

Pr[ui = w A ui ^ U2] 



Pr[ni = w \ ui ^ U2] 



Pr[ni = w] ■ Pr[ni ^ U2 \ ui = w] 
T.uew Pi'[^i = ^] • P^[^i ^U2\ui=u] 

{d{w)/D) ■ Pr[w -^W\{w} <w] 
j:^^,^{d{n)/D) .Pv[u^W\{n} <u] 

d{w) ■ { cond^ ^w\^^} /d{w)) 

T.uew ^(^) ■ ( cond„^vy\{«} /d{u)) 
cond^^VF\{w} 



(by Claim [aH) 



J2uew condu,w\{u} 
This proves the claim. D 

Claim G.8. Let A and B be disjoint events. Let C be another event. 

Pr[C I A] > Pt[C \A\/B] =^ Pr[C | B] < Pr[C \ AV B]. 
Claim G.9. For any w £ W , 

Pt[w^z<W\{w] I w^W\{w]<w] < Pi[vj -^ z <W\{w}]. 
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Proof. Define 

B = w ^W\ {w} < w 
A = {w —J- z <w)\B 
C = w-^ z <W\{w}. 

Since Pr[C | .4] = 1, the hypotheses of Claim iGTSl are satisfied, and therefore 

Ft[w -^ z <W\ {w} \ w —^W\ {w} <w] < Pt[w -^ z <W\ {w} \ w —^ {W \ {w}) U {z} < w]. 

By Claim iGTSl the latter quantity equals Pr[t(; -^ z < W \ {w}]. D 

Now, we analyze the probability that the random walk hits the cut. We condition on the event 
ui 7^ tt2) since that is the only case when the algorithm contracts edges. 

Pr [random walk hits z \ ui ^ U2] 
= y Pr[ni = w \ ui ^ U2] • Pr[ random walk hits z \ ui = w f\ ui ^ U2\ 

= y^ Pr[ni = w\ui^ U2]- Pr[ w ^ z <W\ {w} \ w ^W\ {w} < w ] 
wew 

^ sr^ c(m^^^^w^ Pr[u;^z< VF\{w} \w^W\{w}<w] (by Claim [Gil) 

^ l^uaw cond„,iy\W 

< y cond^,^\{^} _ p^^^ ^z<W\ {w}] (by ClaimElD 

< > :^^ -^-^ — ^-^-^^ -VtVw ^ z <W\{w}] 

^ E^eiy cond„,iy\W 

= y ^^ "-^^^^ pr[^ ^y,^w\ {w}] (by Lemma EU 

^^ y Pr[z^^<W^\M] 

^ V ' 

(= 1, because the walk is ergodic) 

cond^ w 



< 



2\C\ 



K\W\ 



The last inequality is because the node z has degree 2|C|, and every node u ^W \s connected to 
some node v ^W \ {u} by a black edge, and co\\.diu,w\{u} ^ cond^,!, > K. ■ 



H Algorithms for constructing sparsifiers 

In this section, we sketch the algorithms as stated in Theorems 11.71 11-81 and 11.91 They are simple 
modifications of the algorithms of Bencziir and Karger [5]. The main difference is that Bencziir 
and Karger's algorithms compute approximate edge strengths whereas our modifications compute 
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approximate edge connectivities. The sparsifiers we obtain have shghtly larger size but the al- 
gorithms are simpler and more efficient because approximating the edge connectivities is quite a 
bit easier than approximate the edge strengths. Our algorithms can be easily implemented, and 
furthermore they can be used as a preprocessing step for computing smaller sparsifiers. The proofs 
of correctness of these algorithms are almost exactly the same as the proofs in [5J . 

H.l Finding 0{nlog^ n)-size sparsifiers for graphs with polynomial weights 

We now present an algorithm that computes a sparsifier of size 0(n log n). It runs in 0{m) time 
for unweighted graphs and 0{'m + nlog n) time for graphs with polynomial weights. Recall that, 
in Theorem ll.il it is sufficient to find Kg that is a lower bound of the edge connectivity ke- 

The main tool that we use is the k-certificate introduced by Nagamochi and Ibaraki [35]. Given 
a multigraph G = {V, E), they partition E into a set of forests Fi,F2, . . . in the follwing way. Let Fi 
be a maximal forest of G and for i > 1, let Fi he a maximal forest of the subgraph G — Uj<i Fj. Each 
Fi is called a Nl-forest. Nagamochi and Ibaraki showed that for any integer k > 0, H^ = L)j<_kFj, 
the union of the first k NI- forests, preserves all cuts of G that have size at most k. Thus H^ contains 
all {k + l)-light edges of G (an edge e is A;- heavy ii ke > k and A;-light otherwise). Hf^ is called a 
/c-certificate. Clearly, it contains at most k{n — 1) edges. 

Nagamochi and Ibaraki [35j presented an algorithm for labeling every edge e with a label r^., 
such that if an edge e has multiplicity Ue > 1, the Ue copies of e are contained in the Ue NI- forests 
Fr^-ue+i, ■ ■ ■ iFre- For simple graphs, it runs in 0{m) time. For multigraphs, a slightly modified 
version |2T] of this algorithm runs in 0{m + nlogn) time. 

Let e = st be an edge. Note that if e appears in Fj, then s and t must be connected in Fj for 
every j < i, for otherwise e can be added to Fj, which contradicts the maximality of Fj. Therefore, 
Te is a lower bound of k^ and we can set Kg = r^- Suppose we sample with probability p^ = l/^e as 
described in Theorem ll.il Then with high probability, this will produce a sparsifier that preserves 
every cut to within a 1 ± e factor. Since we assume the weights are polynomially bounded, the 
expected number of edges per round is 

•n? poly(n) 'n? poly{n) 

Y,Ue/re< Yl J^Vi<(^-l) Y. l/i = 0(nlogn). 

eG-E j=l e^Fj j=l 

Therefore with high probability, the sparsifier contains 0(n log n/e^) edges. 

Using the Nagamochi-Ibaraki Certificate algorithm, all Ke can be found in 0{m + nlogn) 
time. For each edge e, we can decide in expected constant time whether at least one copy of 
e is included in the sparsifier. For an edge that has at least one copy in the sparsifier, we can 
find the sum of weights of all copies of it by sampling from the distribution Binomial(ue/5, Pe) 
instead of sampling Ue Bernoulli random variables per round. This sampling is easy to do in 
0{uePPe) = 0{ue\og njke^^ = 0(log n/e^) time, for each edge included in the sparsifier. We 
suspect that this can be improved using the technique of Knuth and Yao [29] [HI Ch. 15], but leave 
the details to future work. Therefore, the total running time is 0{rn + nlog n/e^) for graphs with 
polynomial weights. 

For unweighted graphs, we can reduce this to 0{m) time. Note that for unweighted graphs, 
Ue = 1 for all e € E. If an edge e has p/ke > 1, instead of performing p rounds of sampling, we 
can include e with probability 1 and assign it a weight of 1. Thus we can sample the weight of an 
edge in expected constant time. This change can only decrease the expected size of the sparsifier 
and the sampled weight of every cut can only be more concentrated around its mean. 
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Algorithm 5 Algorithm for estimating edge connectivities. 



procedure ConnectivityEstimation(/7,A;) 
input: subgraph H oi G 
E' ^ Partition(i?,2A;) 
for each e & E' 

for each nontrivial connected component H' C H — E' 
ConnectivityEstimation(i7' ,2k) 



We remark that recent work of Hariharan and Panigrahi [20] analyzes the same algorithm 
and shows that actually setting p = 0(log(n)/e^) is sufficient, whereas we set p = 0(log^(n)/e^). 
Therefore the size of their sparsfier is only 0(n log (n)/e^). 

H.2 Finding 0{nlog n)-size sparsifiers for graphs with polynomial weights 

In this section, we describe another algorithm for computing k^ which has the advantage that the 
computed Kg's satisfy YleeE'^e'^e ~ 0{n). By the last statement of Theorem ll.il the size of the 
sparsifier would be 0(n log n). The algorithm, given in Algorithm O is a slight variation of the 
Estimation algorithm in [5]. 

The algorithm is based on finding fc-partitions. A k -partition of a graph G = {V,E) is a set 
E' (1 E of edges that includes all {k + l)-light edges such that \E'\ < 2k{r — 1) if G — £" has r 
components. A /c-partition is a "sparser" version of A:-certificate as a /c-certificate can have k{n — 1) 
edges. 

Lemma H.l (Bencziir and Karger [5]). There is an algorithm Partition that outputs a k-partition 
in 0{m) time for unweighted graphs and 0{m\ogn) time for graphs with arbitrary weights. 

We compute the k^s by using the ConnectivityEstimation procedure below. It is almost the 
same as the Estimation procedure in f5j, which is for finding lower bounds of edge strengths. The 
only difference is that they call a WeakEdges procedure to find the /c-weak edges instead of calling 
Partition to find A;-light edges. 

Lemma H.2. After a call to ConnectivityEstimation(G ,1 ) , all the labels Ke satisfy k^ < k^,. 
Proof. The proof is the same as the proof of Corollary 4.8 in [5]. ■ 

Lemma H.3. The values Kg output by ConnectivityEstimation satisfy ^ l/ztg = 0{n). 

Proof. The proof is the same as the proof of Lemma 4.9 in |5]. ■ 

Lemma H.4. ConnectivityEstimation runs in 0{m\og n) time on a graph with polynomial 
weights. 

Proof. For a graph with polynomial weights, the maximum connectivity is bounded by some fixed 
polynomial, so there are at most O(logn) levels of recursion. The total number of edges in all the 
input graphs to Partition at each level of recursion is at most m. By Lemma IH.H each level of 
computation takes O(mlogn) time. ■ 

Suppose we sample as described in Theorem 11.11 By Theorem 11.11 and Lemma 111.31 the size 
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of the sparsifier is 0(nlog n/e^). The time needed for finding k^s is 0(7nlog n) and the time 
needed for finding the weight of every edge is 0(n log n/e^). Therefore, the total running time is 
0(?7ilog n + nlog n/e^). 

H.3 Finding 0(r2logr2)-size sparsifiers for graphs with polynomial weights 

In [S], Bencziir and Karger presented an algorithm that finds a sparsifier of size 0(n log n/e^) for 
graphs with arbitrary weights in 0(?n, log n) time. This can be combined with the algorithms in 
the last two sections to prove Theorem II. 91 

Suppose we are given a graph G. First we apply Theorem 11.71 to find a sparsifier G' which 
approximately preserves all cuts of G to within a multiplicative error of 1 it e/4. G' has size 
0(n log n/e^) and this takes 0{m + nlog n/e^) time. Then we apply Theorem 11.81 to find a 
sparsifier G" which is a 1 ± e/4-approximation of G' . G" has size 0(n log n/e^) and this takes 
0(n log n • log n/e^ + nlog n/e^) = 0(n log n/e'*) time. Finally, we use Bencziir and Karger's 
algorithm to obtain a sparsifier G'" that is a 1 it e/4-approximation of G" . G'" has size 0(n log n/e^) 
and this takes 0(n log n/e^ • log n) = 0(n log n/e^) time. 

Note that G'" approximately preserves all cuts of G to within a multiplicative error of 1 it e. 
The total running time is 0{m + nlog n/e^). 
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